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Abstract 

It is an intriguing question to see what kind of information on the structure of 
an oriented graph D one can obtain if D does not contain a fixed oriented graph 
H as a subgraph. The related question in the unoriented case has been an active 
area of research, and is relatively well-understood in the theory of quasi-random 
graphs and extremal combinatorics. 

In this paper, we consider the simplest cases of such a general question for 
oriented graphs, and provide some results on the global behavior of the orientation 
of D. For the case that H is an oriented four-cycle we prove: in every H-free 
oriented graph D, there is a pair A, B C V(D) such that e(A, B) > e(D) 2 /32\D\ 2 
and e(B, A) < e(A, B)/2. We give a random construction which shows that this 
bound on e(A,B) is best possible (up to the constant). In addition, we prove a 
similar result for the case H is an oriented six-cycle, and a more precise result 
in the case D is dense and H is arbitrary. We also consider the related extremal 
question in which no condition is put on the oriented graph D, and provide an 
answer that is best possible up to a multiplicative constant. Finally, we raise a 
number of related questions and conjectures. 



1 Introduction 

Many results in graph theory show that local properties of a graph have global conse- 
quences. An example, which is particularly relevant to us, comes from the theory of 
quasi-randomness [7] , and shows that if a large dense graph G does not contain a fixed 
graph H as a subgraph, then it must contain a large (linear-size) subgraph whose density 
is significantly different from that of the original graph. Thus, the restriction that G 
does not contain H as a subgraph forces G to exhibit irregular behaviour on a global 
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scale. In another direction, it has been conjectured by Erdos and Hajnal [TU] that for 
each graph H, there is a constant a > such that every graph G that does not contain 
an induced copy of H, must have a clique or independent set of size at least n a . A 
partial result, proving the existence of a clique or independent set of size exp(cv / logn), 
has been given [TU] (see also [5]). 

We prove results for oriented graphs which also show that large scale irregularities 
can be deduced from local properties. In this sense our results are quasi-randomness 
type results for oriented graphs. That said, they differ significantly in style from the 
work on quasi-randomness by Thomason [531 [H]; Chung, Graham and Wilson [7J; and 
Chung and Graham [B], which is of a more precise form and applies to dense graphs 
only. In fact, [6] is concerned with tournaments (orientations of complete graphs), and 
does indeed include a proof that excluding a fixed tournament as a subgraph (or even 
bounding the number of copies away from the expected value) does lead to consequences 
on a global scale (for example, vertex subsets X,Y with \e(X,Y) — e(Y,X)\ = r2(n 2 )). 
They actually prove much more - that eleven separate quasi-randomness conditions are 
equivalent. A more direct analogue of these results for general (dense) oriented graphs 
has been considered by the second author [T5] . 

Throughout the article we write D for an oriented graph, n for the number of vertices 
of D, and e(D) for the number of arcs (oriented edges) in D. For a pair of (non- 
necessarily disjoint) subsets A, B C V , we write e(A, B) for the number of arcs from A 
to B, i.e., the size of the set E(A,B) = {xy G E(D) : x € A,y G B}. We say that 
a subgraph E(A,B) is biased if e(B,A) < e(A,B)/2. (The choice of the fraction | is 
rather arbitrary, it could be replaced by any rj G (0, 1) without any significant effect on 
our results.) We write bias(D) for the size of the largest biased subgraph in D: 



The parameter bias(D) measures irregularities in the orientation of D. If bias(D) is 
small, then one might say that D has a random-like orientation. For example, if D is 
obtained by orienting edges at random, then bias(D) = 0(ri) with high probability (see 
Lemma l4~Tj) . In general, however, bias(D) may be as large as e(D), the number of arcs 
of D. We now state our main result. 

Theorem 1.1. There exists a constant s > such that every oriented graph D with 
bias(D) < ee(D) 2 /n 2 contains an oriented four-cycle. 

Remark. This result is best possible, up to the choice of e. In Section [4] we construct 
a family of oriented graphs D which have bias(D) < Ke(D) 2 /n 2 but do not contain 
oriented four-cycles, where K is a fixed constant. 

Theorem 1.2. There exists a constant e > such that every oriented graph D with 
bias(D) < ee{D) 2 /n 2 contains an oriented six-cycle. 

We have no reason to believe that this result is best possible. In fact we conjecture 
a stronger result. 

Conjecture 1.3. There exists a constant e > such that every oriented graph D with 
bias(D) < ee(T)f'l 2 jn contains an oriented six-cycle. 



bias(D) := max <^ e{A, B) : A,B <ZV such that e(B, A) < 
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We also have a more general conjecture concerning oriented cycles of all even lengths. 



Conjecture 1.4. For each k > 2, there exists a constant e > such that every oriented 
graph D with bias(D) < ee(D) k ^ k ~ 1 ' /n 2 ^ k ~^' contains an oriented cycle of length 2k. 

Remark. The analogous question for oriented cycles of odd length is not so interesting. 
Indeed, a random orientation D of the complete bipartite graph with parts of order 
|_n/2j and \n/2] has bias(D) very small (of order n) while D does not contain any 
oriented cycle of odd length. Looking for odd length cycles may become interesting in 
the case the underlying graph is not arbitrary, cf. Concluding Remarks (Section [7|). 

In addition to guaranteeing the presence of an oriented cycle of prescribed length, 
the above conditions can also be used to guarantee many such cycles. 

Theorem 1.5. There exist constants c, e > such that every oriented graph D with 
bias(D) < ee(D) 2 /n 2 contains at least ce{D) A /n A oriented four-cycles. 

Theorem 1.6. There exist constants c, e > such that every oriented graph D with 
bias(D) < ee(D) 2 /n 2 contains at least ce(D) 6 /n 6 oriented six-cycles. 

Remark. The number of cycles obtained is (up to the constant) best possible. A 
random orientation of the random graph G(n, e) (recall that G(n, e) is a graph selected 
uniformly at random from the class of all graphs on n vertices with e edges) will generally 
obey the bias condition (see Lemma |4.1[) . and have 0(e 4 /n 4 ) oriented four-cycles and 
0(e 6 /n 6 ) oriented six-cycles. 

The above results are relatively general in the sense that they apply in the case D 
is sparse as well as the case D is dense, while they arc restrictive in the sense that they 
only look for oriented four-cycles and oriented six-cycles. The next theorem does the 
opposite - restricting to the case D is dense, it counts copies of any oriented graph H. 

In what follows we count homomorphic copies of subgraphs. Let H be an oriented 
graph on k vertices. We write hom(H, D) for the number of homomorphic copies of H 
in D, i.e., the number of functions cf> : V(H) — > V(D) such that <j){x)<j){y) £ E(D) for 
every arc xy £ E(H). Likewise, we write hom(H,D) for the number of homomorphic 
copies of the unoricntcd graph H in D, i.e., the number of functions <fi : V(H) — > V(D) 
such that for every edge {x, y} £ E(H) either cf)(x)<fi(y) or (f>(y)<j){x) is an arc of D. 

Theorem 1.7. Let H be an oriented graph on k vertices and let D be an oriented graph 
with bias(D) < en 2 . Then 



Since one may easily bound the number of degenerate homomorphic copies of H , one 
easily deduces the following corollary. 

Corollary 1.8. Let H be an oriented graph on k vertices and let D be an oriented graph 
with bias(D) < en 2 . Lf hom(H,D) > 3 e ( H >en k and n > 4e~ 1 , then D contains H as a 
subgraph. 



hom(H 7 D) > 



hom(H, D) 

3e(H) 




k 
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Since the main results of the article may be viewed as lower bounds on bias(D) for 
iJ-free oriented graphs D (for given choices of H) 1 it is natural to also consider the 
related extremal question: 

What is the minimum value of bias(D) over oriented graphs on n (non- 
isolated) vertices? 

It will be seen from the results we obtain that this is indeed the most natural form 
of the extremal question. This question will be considered in detail in Section [51 The 
main results establish that the minimum value of bias(D) over oriented graphs on n 
non-isolated vertices is 0(n/logn). We also prove that the answer is larger, 0(n), in 
the case that the oriented graph is regular, or close to regular. In addition, we provide 
an algorithmic proof of the lower bound in the case D is regular. 

The layout of the article is as follows. In Section [2] we prove Theorem 11.11 and 
Theorem 11.51 In Section [3] we prove Theorem 11.21 and Theorem 11.61 In Section 2] we 
describe a family of oriented graphs which show that Theorem 11.11 is best possible (up 
to the choice of e). In Section [5] we consider the case of dense oriented graphs, and 
provide a proof of Theorem 11.71 In Section [6] we discuss extremal questions concerning 
the parameter bias(D). Finally, we give in Section [7] a number of concluding remarks. 
These remarks include new questions and conjectures, together with further discussion 
of the conjectures stated above. Wc should note here that it is possible to obtain 
another proof of Corollary 1 1 . 81 (with some different (worse) constants) using Szemeredi's 
regularity lemma. The details of this proof are included in the Appendix of the arxiv 
version of this article (arXiv:091 1.3969). 



We begin by defining some notation. Let a; be a vertex of an oriented graph D. We shall 
use the following notation: 



We let d + (x) = |r + (x)| and d (x) = \T (x)\. We also define the following notation for 
joint-degrees, we let d ++ (x,u) = \T + (x) n T+(u)\ and d + ~(x,u) = \T+(x) n T~(u)\. 

We now prove a useful lemma. Recall that throughout D denotes an oriented graph 
on n vertices with e arcs. 

Lemma 2.1. Let D be an oriented graph with bias(D) < e(D)/2. Then D contains at 
least e(D) 2 /8n paths of length two. 



2 Oriented Four- Cycles 



r+(aO 
r~(x) 
r++{x) 

T-(x) 



{yeV:xye E(D)}, 
{yeV:yi€ E(D)}, 
{y e V : 3z G Vxz,zy G E(D)}, 
{y G V :3z G Vyz,zx£ E(D)}. 
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Proof. The number of paths of length two in D is 

Denote by Z the set of vertices y for which d~(y) > 2d + (y). By summing over vertices 
y G Z one finds that e(V, Z) > 2e(Z, V), and so, by the definition of bias(D), we have 
that e(V, Z) < bias(D) < e/2. Therefore e(V, Y) > e(D)/2, where Y = V \ Z. We also 
have that d + {y) > d~(y)/2 for all y <S Y, so that 

E ^ ^ J E d_ (y) 2 > ^( E 



2^ v ' ~ 2n 

y ev s ey yer 

where the final inequality follows from an application of the Cauchy-Schwarz inequality. 
The proof is now complete, as J2 y eY d~(y) = e(V, Y) > e(D)/2. □ 

We now turn to the proof of Theorem ll.il We shall prove this theorem with e = 1 /32. 
Thus, throughout the proof we may assume that D has bias(D) < e(D) 2 /32n 2 . Note 
also the following useful formulation of the bias(D) < ee(D) 2 /n 2 property: 

e(B, A) > e(A, B) /2 whenever e(A, B) > ee(D) 2 /n 2 . (*) 

Also we introduce a final piece of notation. For each vertex x, we write e x for the number 
of paths of length two in D which start at x. Equivalcntly, e x = e(T + (x), T ++ (x)). 

Proof of Theorem \l.l[ Let D be an oriented graph with bias(D) < e(D) 2 /32?i 2 . By 
Lemma |2~T1 we know that there are at least e(D) 2 /8n paths of length two in D. Denote 
by W the set of vertices x with e x > e(D) 2 /16n 2 . Since at most e(D) 2 /16n paths of 
length two start at vertices outside of W, we have that ^2 xeW e x > e(D) 2 /16n. 

For each vertex x € W, we have e(T + (x),T ++ (x)) = e x > e(D) 2 /16n 2 and so, by 
(*), we have that e(T ++ (x),T + (x)) > e x /2. Equivalently 

E d ++ M>|. 

uer++(x) 

Summing over x G W , we obtain 

£ E , ++(l ,„)> S |>Sg!. 

We now consider a change in the order of summation. 

E E ^ ++ m>E E d++ M^ilr- 

In particular this implies that for some u € V one has Ylxer-- (u) d ++ (x, u) > e(D) 2 /32n 2 . 
Equivalcntly, e(T~~ (u),T + (u)) > e(D) 2 /32n 2 . A final application of (*) gives that 

e(T + (u),T—(u)) > e{D) 2 /QAn 2 > 0. 

This precisely gives us an oriented four-cycle. □ 
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In the above proof we use the property (*), together with the trick of changing the 
order of summation, to deduce the existence of a pair of vertices x, u between which 
there is a path of length two in each direction. This is not a rare occurrence, rather 
it is typical. Further, it is typical that the number of paths of length two in the two 
directions is of the same order. The following lemma proves this fact and allows us 
to deduce Theorem 11.51 Recall that g? h (x,u) denotes the number of paths of length 
two from x to u. Say that (x,u) is unbalanced if d^ (x,u) > I6d~* [u, x), otherwise 
it is balanced. A path of length two xy, yu is called unbalanced if {x, u) is unbalanced, 
otherwise it is balanced. 

Lemma 2.2. Let D be an oriented graph with bias(D) < ee(D) 2 /n 2 . Then the number 
of unbalanced paths of length two in D is at most 8ee(D) 2 /n. 

Proof of Theorem ] 1.5[ Let c = 1/4. 16 3 and e = 1/128. By Lemma I2TT1 there are at least 
e(D) 2 /8n paths of length two in D. By Lemma l2~2l at most 8ee(D) 2 /n < e(D) 2 /l6n of 
these paths are unbalanced. Therefore there are at least e(D) 2 /16n balanced paths in 
D. Let C x denote the number of oriented four-cycles containing the vertex x. 

C x = V d + -(x,u)d-+(x,u) > — V d + -(x,u) 2 . 
^— ' 16 ^-^ 

u£V u:(x,u) balanced 

Summing this quantity over x, and applying the Cauchy-Schwarz inequality, one obtains 
£^>J_ ^ ^-{x,uf>^{ £ d+-(x,u)) 2 . 

x x,u:(x,u) balanced (a^u) balanced 

This sum counts exactly the number of balanced paths of length two and so is at least 
e(Z?) 2 /16n. Thus, ^2 X C x > e(D) 4 /16 3 n 4 . The proof is now complete as this sum counts 
each oriented four-cycle four times. □ 

Proof of Lemma \2.'£\ Denote by / the number of unbalanced paths of length two, and 
suppose that / > 8ee(D) 2 /n. Let f x denote the number of unbalanced paths of length 
two starting at x, and let W denote the set of vertices x for which f x > f /2n. Since at 
most f/2 unbalanced paths of length two start at vertices outside of W, we have that 
at least f/2 unbalanced paths start inside W, i.e., J2 xew fx > f/2. For each vertex 
x £ W, we denote by U x the set of vertices u for which (x, u) is unbalanced, we have 
that e{T + (x),U x ) = f x > ee{D) 2 /n 2 . By (*) wc have that e(U x ,T+(x)) > f x /2, i.e., 

£«:(*,«) unbalanced d++{x,u) > f x /2. TllUS, 

E rf + +M>i]T/,>//4. 

(x ,u) unbalanced xGW 

Alternatively, denoting by X u the set of vertices x for which (x, u) is unbalanced, 

J2<Xu,r+(u))>f/4. 

u 

Let U denote those vertices u for which e{X Ul T + {u)) > f/Sn, and note that 

^e(I„r+W)>//8. 
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However, for each u <E U, we have by (*) that e(T + (u),X u ) > e(X u , T + (u))/2. And so 

J2e(r+(u),X u )>f/W. 

Reinterpreting this sum in terms of (x, u) and recalling that 

/= d + ~(x,u), 

(x,u) unbalanced 

we have that 

J2 d+-(u,x)>^ d+-(x,u). 

(x,u) unbalanced (x,u) unbalanced 

Thus, there exists an unbalanced pair (x,u) with g? h (u,x) > <i H (x,u)/16, a contra- 
diction. □ 



3 Oriented Six- Cycles 

In this section we prove Theorem II .61 Note that this immediately implies Theorem 1 1.2 1 
First, a lemma showing that there are many paths of length two ending at vertices with 
out-degree at least e(D)/8n. 

Lemma 3.1. Let D be an oriented graph with bias(D) < e(D)/8. Then D contains 
at least e(D) 2 /8n oriented paths of length two whose end point has out- degree at least 
e(D)/8n. 

Proof. We say that an arc xy is good if d~(x) > d + (x)/2 and d + (y) > d~(y)/2. It 
is very good if in addition d~ (y) > e(D)/4n. We first show that at least e(D)/2 arcs 
are very good. Let Z\ = {x : d + (x) > 2d~(x)} 7 Z 2 = {y : d~{y) > 2d + {y)} and 
Z 3 = {y : d~(y) < e(D)/4n}. An arc is very good unless its start point is in Z\ or its end 
point is in Z 2 UZ 3 . Since e(Z\,V) > 2e(V, Z x ) we have from (*) that e(Zi, V) < e(D)/8. 
Similarly e(V, Z 2 ) < e(D)/8. Trivially e(V,Z 3 ) < e(D)/4. Hence, at least e(D)/2 arcs 
are very good. For each vertex x € V \ Z\, let d+ g (x) denote the number of very good 
arcs with start point x. Thus, the number of paths of length two the second arc of which 
is very good, is at least 



J2 d-{x)dt g {x)>\ W>U E <(-)) 2 > 



2 P in\ 2 



ejDY 

xev\Zi xev\Zx xev\Zx 

where the second inequality follows from the Cauchy-Schwarz inequality. The proof 
is now complete, as a path of length two whose second arc xy is very good has that 
d + (y) > d~ (y) 12 > e(D) /8n. □ 

For our proof of Theorem 11.61 we shall need some new notation. This notation will 
allow us to prove the existence of the constants c, e > without explicitly calculating 
them. Specifically, 7(e) denotes any decreasing function of e which is positive for all 



7 



sufficiently small e > 0. While 5(e) denotes any increasing function of e which has limit 
as e —> 0. It should be understood that 7(e) and 5(e) are not functions, but rather 
classes of functions, in the sense that the commonly used O(-) notation represents a class 
of functions rather than an individual function. To give some examples to clarify this 
notation, we note that for two arbitrary constants c\,C2 > 0, the function — c\e + C2 is 
(of class) 7(e), while — c\e is not 7(e) since for all non-negative (even sufficiently small) 
values of e, — c\e is negative. In the same way, for two arbitrary constants c\,C2 > 0, 
the function c\e is (of class) 5(e) while the function c\e + C2 is not 5(e). It should be 
now clear that 7(e) — 5(e) = 7(e) and 7(e) jC = 7(e) for any constant C, and also that 
7(e)7(e) = 7(e). 

In this terminology, to prove Theorem 11.61 it suffices to show that every oriented 
graph D with bias(D) < ee(D) 2 /n 2 contains at least j(e)e(D) 6 /n 6 oriented six-cycles. 

Let D be such that bias(D) < ee(D) 2 /n 2 . We state some consequences of previous 
results in our new notation. Lemma [3.11 tells us that D contains at least r )(e)e(D) 2 jn 
paths of length two whose end point has out-degree at least j(e)e(D)/n. While Lemma 
12.21 tells us that D contains at most 5(e)e(D) 2 jn unbalanced paths of length two. To- 
gether, these results imply that D contains r )(e)e(D) 2 /n paths of length two which are 
balanced and which have end points of out-degree at least j(e)e(D)/n. 

We are now in position to start our proof. 

Proof of Theorem \1.6[ Let D be an oriented graph with bias(D) < ee(D) 2 /n 2 . Say 
that a path of length two which is balanced and whose end point has out-degree at least 
j(e)e(D)/n is a great path. We found above that D contains at least -f(e)e(D) 2 /n great 
paths. We now form a weighted directed graph H on the same vertex set V as follows. 
Start with the complete directed graph (i.e., the directed graph with arcs xy (and yx) 
for all x y), weight each arc xu by w x - u the number of great paths from x to u, and 
delete arcs of weight zero. By the above result, the total weight of arcs in H is at least 
j(e)e(D) 2 /n. Now divide the set of arcs of H into two categories of heavy and light arcs: 
a heavy arc is an arc of weight at least e(D)/n while a light arc has weight less than 
e(D)/n. Since the total weight in H is at least j(e)e(D) 2 /n, either the total weight of 
heavy arcs or the total weight of the light arcs is at least r y(e)e(D) 2 /n. We shall refer 
to these as Case I and Case II respectively, and denote by H' the subgraph consisting 
of arcs of appropriate weight (heavy arcs of weight > e(D)/n in Case I and light arcs 
of weight < e(D)/n in Case II), although for the time being the proof continues for the 
two cases in parallel. 

For a weighted oriented graph and a vertex v 7 the out-weight (resp. in-weight) of 
v is the total weight of the out-going (resp. in-coming) arcs of v. Let H" be the 
subgraph of H' consisting of all the arcs whose start vertex has out-weight at least 
r )(e)e(D) 2 /n 2 . Since to total weight in H' is at least j(e)e(D) 2 /n, a simple averaging 
argument shows that there is a function 7(e) such that the total weight of arcs of H" is 
at least j(e)e(D) 2 jn. For each vertex x with positive out-weight in H", we denote by 
e x its out-weight; in particular, note that by the definition of H", e x > j(e)e(D) 2 /n 2 . 

Our proof will be based on a local argument concerning the distribution of weights 



8 



of H" around each vertex of positive out-weight. To simplify the presentation, fix such 
a vertex x and define a new (induced) weighting oj x , or simply uj if there is no risk of 
confusion, on the vertices by setting lo u equal to the weight of the arc xu in H" (if xu 
is not an arc of H", simply define uj u = 0). Define the weight of a path of length two 
vaj, yv in D to be lo u uj v . We note again that the weighting uj depends on the choice of x 
and is simply induced from the weighting of out-going arcs from x. 

Claim. Under this weighting of the vertices, the total weight of paths of length two 
in D is at least j(e)e 2 e(D) 2 /n 3 . 

Before presenting the proof of this Claim, and to keep the continuity of the proof, 
we show how the Claim allows to finish the proof of Theorem 11.61 We will deduce from 
the Claim that there are at least r y(e)e(D) 6 /n 6 oriented six-cycles in D. We recall that, 
for a fixed vertex x and in terms of the original weighting of the arcs of H", the Claim 
is discussing the total weight of paths of length two in D where the weight of the path 
uy, yv is defined to be w x - u w x - v (this now being the weight in H"; recall that w u = w x -^). 
Recall also that, by the definition of the weighting of the arcs, if w X v > 0, then there are 
w x - v great paths of length two from x to v in D, and in particular this tells us that (x, v) is 
balanced, and so that in addition there are at least "f(e)w x - v paths of length two from v to 
x. This in turn implies that each path of length two vy, yv allows us to find "f^w^uix-y 
oriented six-cycles containing x (simply combine each of the w x - u paths of length two from 
x to u with the path uy, yv followed by each of the j(£)w x -v paths of length two from v to 
x). Thus, writing C x for the number of oriented six-cycles in D which contain x and using 
the Claim, we have that C x > , y(e)e 2 e(D) 2 / 'n 3 . Summing C x over all vertices x with 
positive out-weight in H", we obtain that J2 X C X > 7(e) J2 X e 2 e(D) 2 /n 3 . By Cauchy- 
Scwartz, this is at least 7(e)(Xix e x) 2 e(D) 2 /n 4 , and so, since the sum expresses the total 
weight of the arcs of H", ^2 X C x > r y(e)e(D) 6 /n 6 . This sum counts each oriented six- 
cycle at most six times, so the number of oriented six-cycles in D is 7(e)e(-D) 6 /n 6 , and 
Theorem 11.61 follows. 

Thus, we are only left to prove the above Claim. 

Proof of Claim. We recall that the weighting of vertices (induced by the fixed vertex 
x of positive out- weight) is simply defined in such a way that the weight oj u of a vertex 
u is the weight w x ~ u of the arc xu. We will divide the proof into two parts, depending 
on whether we are in Case I or in Case II. 

(Recall that in Case I, the subgraph H' consists of all the heavy arcs, i.e., all the arcs of 
weight > j(e)e(D)/n, while in Case II, H' is the subgraph containing all the light arcs.) 

The proof is simpler if we are in Case I. In this case, every vertex with positive weight 
has weight at least e(D)/n, so that every path of length two with positive weight has 
weight at least e(D) 2 /n 2 . It thus suffices to find at least j(e)e 2 /n paths of length two 
of positive weight. We denote by U the set of vertices of positive weight, i.e., 

U :={ueV(D)\lu u >0}. 

Since x has positive out-weight in H", by the definition of H", it has out- weight 
e x > -f(e)e(D) 2 /n 2 . This implies that e(T + (x),U) = e x > -y(e)e(D) 2 /n 2 and so, triv- 
ially, e(V,U) > e x > r )(e)e{D) 2 /n 2 . Let now Z be the set of all vertices which are 
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"unbalanced" with respect to U, namely, let 

Z:={y | e({y},U)>2e(U,{y})}. 

Our condition on bias(D) implies that e(Z, U) < ee(D) 2 /n 2 ]. We infer that e(Y, U) > 
j(e)e x , where Y — V \ Z, and so the number of paths of length two from U to U is at 
least 

]T e({y}, U)e(U, {y}) > J ]T e (M' U ? ^ T- ( E e (M' Uj) 2 ^ n . 



2 ^ VL " J ' ' ~ 2n 

Which completes the proof in Case I. 

For Case II, we divide the vertex set according to weights. For each i > 1, define 

-e(D) e(D) . i 



Vi := { « | w„ 6 [■ 



2*n ' 2 1 - 1 ) 



Since we are in Case II, all the weights uj u are less than e(D)/n and thus, V+ = \_\ i>1 Vi, 
where V+ is the set of all the vertices of positive weight. 

The idea in this case is roughly speaking as follows (to make this idea work, we have 
to restrict the subset of indices to some subset I, see below): To prove the claim, that 
the total weight of paths of length two in D with respect to the weighting uj is at least 
r y(e)e 2 e(D) 2 /n 3 , we will only consider a subset of all the paths of length two from V+ to 
V+ , and decompose the set of all such paths into different sorts depending on the starting 
point of the path being in Vi and the end point of the path being in Vj, for i,j G I (for 
a subset I of the index set that we will define below). Let e^iVi, Vj) be the number of 
paths of length two from Vi to Vj . By the definition of Vi , all the weights oj u are "almost" 
uniform for u G Vi, namely, Tjn^ < w„ < ^t=rz ■ It follows that the total contribution 
of the paths of length two from Vi to Vj, for i,j £ I, is &(^J2i jei 2*+i > - 2 n 2 e2 ' ^J')) ' 

and thus, we only need to show that J2i jei 2^-' i ri 1 e 2(^i Vj) ^ 7( £ ) e 2- e (^) 2 / n3 - This 
is exactly what we will do in the following. 

For each i, let Sj = \Vi\. Since by our assumption, x has positive out- weight in H" , 
it must have out-weight e x > r y(e)e(D) 2 /n 2 . This implies that 

e(r+(x),V+) = e x > 7 ( £ )e(Z?) 2 /n 2 . 

Given that for each i, oj u is at most ^-^n ' a simple summation shows that at most 
ce(D) 2 /n 2 of these arcs go to sets Vi for which < ce(D)/n, this for any constant 
c > 0. We infer, by the definition of the class j{e), that for some sufficiently small 
constant c, we have 

e(r+(^,F J )> 7 (e)e :cj 



where 7 = ji | a* > ce(D)/n| and Vi = Uiei 
Since 2 Si ^H > e(r+(x), Vi), it follows that 



>l{e)e x - (1) 



^ 2 i " 1 n 
iei 
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For every vertex u £ V+, the arc xu has positive weight in H" C H, and by the 
definition of H , we know that u must have out-degree at least j(e)e(D) /n in D. Therefor, 
for each % £ I, one has 

e(^,y)= ^Td + (u) > |F i | 7 (e)e( J D)/n > cry(e)e(D) 2 /n 2 = j(s)e(D) 2 /n 2 . 

ueVi 

Define li to be the set of "balanced" vertices with respect to Vj, namely, 

Y i ={y\e(V i ,{y})<2e({y},V i )}. 

Using the definition of bias, we may bound by ee{D) 2 / n 2 the number of arcs going to 
vertices y £" Yi, deducing that 

e(Yi,Yi) > "f(e) Si e(D)/n (> j(e)e(D) 2 /n 2 ). (2) 

Consider the weighted bipartite simple graph F which has parts / and V, has 
an edge iy whenever y £ Yi, and has a weight rji y on the edge iy given by r\i y = 
min{ e({y}, Vi) , e(Vi,{y}) }. Now for each walk of length two iy, yj in F, we know 
there are at least r]i y r]jy paths of length two from Vi to Vj in D passing through y (there 
paths are constructed by e(Vi, {y}) > i]i V from Vi to y and e({y}, Vj) > 77^ arcs from y 
to V,). It follows that e 2 (Vi,Vj) > J2 y ev ViyVjy ■ 

Thus, to finish the proof of the claim, given that the weight lo u of a vertex u in Vi 
is in the interval , JrSr)) and that the total weight of paths of length two from 

Uiei Vi t0 Uiei V. is ©( Z)i,j e j 2'+^^ eaO^Vj)) > i* suffices to prove that 

E E a^W^ " ^) e * e ^)> 3 - (3) 

Define the weight of a walk iy,yj in F to be e(D) 2 r]i y r]jy/2 1+: >~ 2 n 2 . so that we will 
have to show that the total weight of walks of length two in F is at least -f(e)e 2 e(D) 2 /n 3 . 
This follows immediately from the more general result (itself a trivial consequence of the 
Cauchy-Schwarz inequality) that for any graph with weighted vertices and edges, the 
total weight of walks of length two (where the walk uy, yv is assigned weight rjuVuyVyvVv) 
is at least W 2 /n, where W denotes J2 u VuJ2 y Vuy In our case, in the bipartite graph 
F, the edge weights are rji y for i £ / and y £ Yi, and the vertex weights are defined by 
r]i = 2i D i n f° r * <= I an( i f](y) = ior y <E V. We have 



(by the definition of rji y ) 

(by Inequality ©) 
(by Inequality 



W = E^E^ 

> 7 (e)e(J) y ^eQD) 
~ n ^ 2 i ~ 1 n ' 

> j(e)e x e{D)/n 
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The left term of © is at least W 2 /n > ~f(s) 2 e 2 x e{D) 2 /n 3 = ~f{e)e 2 x e(D) 2 /n 3 and In- 
equality Q follows. This completes the proof of the Claim (and so the proof of the 
theorem) . □ 



4 Examples 

To prove that a result such as Theorem ll.ll is best possible up to the choice of constant, it 
would not suffice to produce just one oriented graph which docs not contain an oriented 
four-cycle and has bias(D) = Ke(D) 2 /n 2 , for some constant K. Nor would it suffice 
to produce a class of examples that were all of the same density. The theorem applies 
across a large range of densities, so one must produce examples across a large range of 
densities. We provide in this section a wide class of examples of oriented graphs which 
do not contain oriented four-cycles and which have bias(D) < Ke(D) 2 /n 2 , where K is 
some fixed constant. 

Our initial examples are obtained as random orientations of four-cycle free simple 
graphs. These will have approximately n 3 / 2 arcs. We will then obtain more dense 
examples as blow ups of these initial examples. Before we do this, we first prove a 
lemma concerning the value of bias(D) (and certain variants) for randomly oriented 
graphs, prove a lemma concerning the value of bias(D) when D is blow up of some other 
oriented graph, and recall a result concerning four-cycle free simple graphs. 

We define a more general concept of bias. For 7 € (0, 1), we say that a subgraph 
E(A,B) is 7-biased if e(B, A) < r ye(A,B), and we write bias 1 {D) for the size of the 
largest 7-biased subgraph of D, so that, 

bias 7 (D) = m&x{e(A,B) : A,B C V with e(B, A) < je(A, B)} . 

Note that bias(D) is of course bias\/2{D). 

Lemma 4.1. Given 7 G (0, 1), there exists K 7 € R such that for every simple graph G 
on n vertices, there exists an oriented graph D obtained by orienting the edges of G with 
bias 7 (D) < K^n. Furthermore, a random orientation D of G has bias-y(D) < K 7 n with 
high probability. 

Proof. For a pair A,BC V{G) we write cab for the number of edges between A and 
B in the graph G. Let D be obtained from G by orienting its edges at random. From 
Chcrnoff's inequality [5] (see Lemma l6.6|) we have 

F(e(B, A) < je(A, B)) < exp(-c(7)e AB ) < cxp(-c( 7 )A» , 

where 0(7) is a positive constant dependent on 7. We set K 1 = 2/c(-f). We shall prove 
that W'{bias 1 {D) > K^n) < 1.8 _n . This proves the lemma. The event bias^D) > KjU 
can occur only if there is a pair A,BC V(G) with ea(A, B) > K^n for which e(B, A) < 
7e(A, B). There are at most 4™ such pairs (A, B) and for each such pair the probability 
that e(B,A) < r ye{A,B) is at most exp(— c(j)cab) < cxp(— c(7)A' 7 n) = exp(— 2n). 
Thus, by the union bound, F(bias 1 (D) > K^n) < 4™exp(-2n) < 1.8~ n . □ 
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If an oriented graph D' contains no large biased subgraphs, then this property carries 
over, in a weakened form, to a blow-up D of D' . A blow-up of an oriented graph D' is 
defined as follows. 

Definition. Let D' be an oriented graph on {1, . . . , m} and let I S N. The l-blow-up 
of D' is the oriented graph D with vertex set V = V\ U • • • U V m , where the sets Vi are 
disjoint and each of cardinality I, and with arc set E{D) = Uij^E(D')B{Vi, Vj), where 
B(Vi,Vj) represents the complete bipartite oriented graph on Vi U Vj with all arcs going 
from Vi to Vj. Each Vi is called a cell of the blow-up. 

The key result we need about blow-ups is, 
Lemma 4.2. If biaso.g(D') < f and D is an l-blow-up of D' , then bias(D) < 16fl 2 . 

Proof. Let D be an Z-blow-up of D 1 . We suppose that bias(D) > 16/7 2 and use this to 
show that biaso.g(D') > f. Our assumption gives us that there exist sets A,B C V(D) 
with e(A, B) > 16 / 'I 2 and e(B, A) < e(A, B) /2. We use this irregularity between A and 
B, this bias in the direction from A to B, to find subsets I,JQ {1, . . . , m} such that in 
D' we have e £>'(/, J) > f and en'(J, I) < 0.9e(J, J), this will prove biasQ.§{D') > f and 
so will complete the proof. 

As a warm-up, we first consider the easier case when A and B are both unions of 
cells of the blow-up. In this case A = UiVi and B = UjVj. Which gives 

e(A,B) 16/Z 2 
e(/, J) - j- 2 > —p- = 16/ > / , 

while 

e(g,A) e(B,A) e(J, J) 
e (J, / ) = _^<^^ = ^_. 

Now for the general case, if there is a relative deficiency in the number of arcs from 
B to A, we identify the vertices of A responsible for this. Let 

A' = {x€A:e({x},B)>^e(B,{x})]. 
Note that e(B, A \ A') > 2e(A\A\B)/3 so that 

e(A, B) > 2e(B, A) > 2e(B, A\A')> -e(A \ A', B) . 

3 

Thus, e(A \A',B) < 3e(A, B) /4, and so e(A', B) > e(A, B) /4 > 4// 2 . Now, let I = {% ; 
ViC\ A' 7^ <j)} and let A" = UjVi. By the homogeneity of parts of the blow-up, we have 
for all x G A" that 

e({x},B)>^e(B,{x}). 
Also e(A",B) > e(A',B) > 4fl 2 . We now begin a similar procedure to find B" . We let 

B' = 10 



>={ v eB:e(A",{y})>-e({y} > A")} 
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This implies that e(B \ B' , A") > 9e(A", B \ B')/1Q, so that 

3 ^ 97 

e(A", B) > -e(B, A") > -e(B \ B',A") > -e(A" , B\B>). 

Thus, e(A", B\B') < 20e(A", B)/27, and so 

e(A",B') > ^e(A",B) > \e{A",B) > fl 2 . 

Let J = {j : Vj fl B' ^ <fi}, and set B" = UjVj. With a similar argument to that given 
previously we obtain that e(A",B") > e(A",B') > fl 2 and 

e(B",A")<^e(A",B"). 

This tells us that in D' we have e(I, J) > fl 2 /l 2 = /, while 

e(gV4") 9eQ4",g") 9e(J,J) 

□ 



The final piece of information we need before stating our examples concerns the 
existence of large four-cycle free simple graphs. Let q be a prime power. The Erdds- 
Rcnyi graph G [12] has V(G) being the set of points of the finite projective plane 
PG(2, q) over the field of order q (so that n = q 2 + q + 1), and an edge between (x, y, z) 
and (x', y', z 1 ) if and only if xx' + yy' + zz' = 0. In fact this implies e(G) = \q{q + l) 2 
and this graph does not contain a four-cycle. So for all n of the form q 2 + q + 1 (where 
q is a prime power), there is a graph G on n vertices with at least ^n 3 / 2 edges which 
does not contain a four-cycle. We would like a result which holds for all n. We recall 
that Bertrand's Postulate states that for all k > 2, there is a prime between k and 2k, 
combining this with the above example one may deduce the following. 

Lemma 4.3. Given n > 2, there exists a graph G on n vertices with at least ^r?l 2 
edges which does not contain a four cycle. 

□ 



We may now state examples of oriented graphs with bias(D) < Ke 2 /n 2 which do not 
contain an oriented four-cycle. Our first examples are obtained by considering random 
orientations of four-cycle free simple graphs. By the above lemma, there exists, for each 
7i, a simple graph G on n vertices which is four-cycle free and has e(G) > n 3 ' 2 /20. 
Let D be obtained by orienting the edges of G at random. Then D certainly cannot 
contain an oriented four-cycle and, by Lemma |4. 11 with positive probability bias(D) < 
Ki/in < 400Ki/ 2 e(D) 2 /n 2 . In particular this gives us for all n an oriented graph 
on n vertices which does not contain an oriented four-cycle and for which bias(D) < 
400K 1/2 e{D) 2 /n 2 . 

Our more general class of examples is obtained by considering blow-ups of the above 
examples. For a fixed constant K (in fact we take K = 6400-Ko.9), we define for each 
pair of natural numbers m and / an oriented graph D m j which contains no oriented 



14 



four-cycle and has bias(D) < Ke{D) 2 /n 2 . The number of vertices of D will be n = ml, 
while e(-D), the number of arcs of D, will be of the order m 3 / 2 / 2 . 

Fix a pair of natural numbers m and I. Let G be a four-cycle free simple graph on m 
vertices with at least m 3/,2 /20 edges. Let D' be an oriented graph obtained by orienting 
G and such that biaso.g(D') < Kq^tti, the existence of such an oriented graph being 
assured by Lemma 14.11 Let D = D m j, be obtained as an Z-blow-up of D'. We now 
have, by Lemma \A. 21 that bias(D) < IQKq qttiI 2 . It is easily observed that n = ml and 
e > m 3 / 2 / 2 /20, so that ml 2 < 400e(D) 2 /n 2 . Thus, bias(D) < 6400K . 9 e(D) 2 /n 2 , and, 
by inspection, D does not contain any oriented four-cycle. 

As a demonstration of the generality of our class (-Dm,i)m,leN of examples, note that 
for any pair rio,eo with eo > n,Q , there is a choice of m and I such that D m i has 
approximately no vertices and approximately eo arcs. Simply choose m to be an integer 
close to ng/400e§, choose a four-cycle free graph G with close to m 3 / 2 /20 edges, and 
choose I to be close to 400e 2 l /n 3 ) . 



5 The case D is dense 

In this section we prove Theorem 11.71 In fact we shall prove a more general result, 
Proposition 15. 11 which also counts homomorphic copies of partially oriented graphs. A 
partially oriented graph if is a graph which may have some of its edges oriented. We 
write e(H) for the number of edges of H that are oriented, e.g. if H is a simple graph 
then e(H) = and if H is an oriented graph then e(H) = e(H). We also introduce the 
notation e(A, B) for the total number of edges (whatever their orientation) between A 
and B. Note that if D is an oriented graph with bias(D) < en 2 , then in particular the 
following holds in D: 

e(B, A) > e ^ B "> _ i n 2 for all A, B C V . (4) 
3 3 

We now turn to Proposition 15. II This proposition clearly implies Theorem 11.71 

Proposition 5.1. Let D be an oriented graph on n vertices satisfying Let H be a 
partially oriented graph on k vertices. Then 

hom(H,D)>^^- { l-3-^)lnK 

Remark. To prove that a dense oriented graph with bias at most en 2 contains H as a 
subgraph, one may use Szcmcrcdi Regularity Lemma (cf proof in Appendix of the arxiv 
version of this article (arXiv:0911.3969)). However we believe that the use of Proposition 
15.11 with the following direct proof is simpler and illustrates better the use of the bias 
parameter. Furthermore it gives better constants. 

Proof. We prove the proposition by induction on e(H). If e(H) = 0, then H = H, 
and so hom(H,D) = hom(H,D). For the general case, let H be an oriented graph 
on {!,..., k} with e(H) > 1. By relabelling if necessary (which does not affect the 
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homomorphism count) we may assume that 12 is an arc (oriented edge) of H. Let 
H' be the partially oriented graph obtained by unoricnting this edge. We now re- 
late the quantities hom(H\D) and hom(H,D). For each (2:3, . . . , x k ) € V k ~ 2 , let 
Hom(H', D; . , . , 23, . . . , x k ) denote the set of homomorphisms of H' into D for which 
4>(i) = Xi for all i = 3, . . . , k. Similarly define Hom(H, D; . , . , 23, . . . , Xk)- In fact, it 
is easy to characterise the homomorphisms £ Hom(H' , D; . , . , 23, . . . , Xk)- A homo- 
morphism G Hom{H' ', D; . , . , 23, . . . , Xk) must have <f>{i) = Xi for i = 3, . . . , k, and 
must pick values for 0(1) and 0(2). Writing x\ for 0(1), we know x\ must join up 
appropriately to the vertices 23 , . . . , Xk ■ Specifically 

(i) x\Xi is an arc of D, for every arc li : i > 3 in H. 

(m) XjX\ is an arc of D, for every arc il : i > 3 in H . 

(iii) x\Xi is an edge of D, for every edge li : i > 3 in If. 



Equivalent^, 21 £ rii>3:i1e.E(ff) T fa) n ni>3:ile.E(ff) r+ (' T ») n r\i>s:iieE(B) r (^)- 
We denote this set A. Similarly, writing 22 for 0(2), there are similar restrictions on 
22) which again are equivalent to demanding that 22 belongs to a certain set, we denote 
this set B. Since H' has an unoriented edge between 1 and 2, we have a final condition 
- the condition that 2122 is an edge of D. Hence for certain sets A and B, we have 
a one-to-one correspondence between homomorphisms £ Hom(H' , D; . , . , 23, . . . , 2&) 
and edges of Z) between ^4 and B. 

Similarly, we may characterise the homomorphisms £ Hom(H, £>;.,., 23, . . . , 2&). 
Again we write x\ and X2 for 0(1) and 0(2). The restrictions x\ £ A and 22 £ -B 
remain. However, on this occasion we require not only that there is some edge between 
x\ and X2, but that there is an oriented edge from xi to 22. Thus, there is a one-to- 
one correspondence between homomorphisms € Hom(H, D; . , . , 23, . . . , 2fc) and edges 
from A to B. 

Thus, \Hom(H / , D; ., . , £3, . . . , 2fc)| and \Hom(H, D; ., . , 23, . . . , x k )\ are e(A, B) 
and e(A, i?) respectively, for some pair of subsets A, B C V . From our condition we 
obtain that 

\Hom(H, D;.,.,x 3 ,...,x k )\> \ H ™&'> D > -^1112^ - ^ . 

Since hom(H, D) is the sum over (23, . . . , x k ) £ V k ~ 2 of \Hom(H, D; . , . , 23, . . . , 2fc)|, 
and similarly hom(H' , _D), we have that 

W(g,^)> fe ° m( f' J) ~V. 

Having obtained this relation between hom(H,D) and hom(H' , D), we require only an 
application of the induction hypothesis. As e(H') = e(H) — 1, an application of the 
induction hypothesis to H' gives hom(H',D) > hom(H, Z^/S^" 1 - (1 - 3 1-S W)§7i fe . 
Combining this with the inequality proved above 



hom(H,D) 1^ Ql -g (g) \e fc e^ k _ hom(H,D) g (H) .t fc 



hom{H,D) > l (m '- - (l^-^> - n *--n* = J, ' ' -(1-3-^) 



3 e ~W 3 V 7 2 3 3 e ~W 

□ 
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6 The extremal problem concerning bias(D) 



The main focus of the present article is the question of what structural information on 
an oriented graph D can be obtained from the knowledge that a certain oriented graph 
H is not a subgraph of D. More precisely, our results relate to the question: 

What is the minimum value of bias(D) over H- free oriented graphs D with 
n vertices and e arcs? 

In this section we consider the related extremal question: 

What is the minimum value of bias(D) over oriented graphs on n (non- 
isolated) vertices? 

It will be seen from the results we obtain that this is indeed the most natural form of 
the extremal question. 

The definition of bias(D) contains the constant |. As commented in the introduction, 
this choice is rather arbitrary, and all our results hold (up to a change of constant) if ^ is 
replaced by some other constant -q G (0, 1). The same is true for the results of this section, 
but since it is not trivial to deduce the bounds on bias^ from those for bias = biasi^, 
we shall prove our bounds for bias v , rj <E (0, 1). We shall state our lower bounds as lower 
bounds on the quantity ow(D) := max{e(A, B) : A,B C V(D), e(B,A) = 0}, the size 
of the largest one-way subgraph in D. Since, trivially, bias n (D) > ow{D), this provides 
a lower bound on bias n (D), rj € (0, 1). Our results are as follows. 

Theorem 6.1. Every oriented graph D on n (non-isolated) vertices has ow(D) > 
n/9[log 2 n\. 

Remark. This result is not difficult to prove, and we remark that it may be easily 
deduced from Lemma 2 of Q3] (in fact the tight version of that lemma, whose proof is 
sketched after the proof of Lemma 2). 

Theorem 6.2. For each 7/ G (0, 1) there is a constant K n such that, for every n, there 
is an oriented graph D on n (non-isolated) vertices with bias n (D) < K n nj log 2 n. 

Remark. This result is proved by considering a random orientation of an inhomoge- 
neous random graph. The resulting graphs typically contain 0(n 2 /(log n) 2 ) edges. Vari- 
ants of this example could be produced with 6(n Q ) edges, for any a £ (1,2) (although 
the constant would be dependent on a). 

One surprising facet of our results is that a much larger one-way subgraph can be 
found in the case that D is out- regular (all out-degrees equal) , irrespective of the degree: 
the lower bound n/4 follows immediately from the following proposition. 

Proposition 6.3. Let D be an oriented graph with maximum out-degree A + ; then 
ow{D) > e(D)/4A+. 

For oriented graphs that are close to out-regular, in the sense that A+ is at most 
a constant multiple of the average out-degree, the above bound is of the same order as 
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the upper bound fLcmma l4.1j) on bias^D) for D whose orientation is random. Simple 
applications of Chcrnoff bounds [5] (see Lemma l6.6[) imply that a random orientation of 
the Erdos-Rcnyi random graph G(n,p) (with p = w(log n/n)) will be close to out-regular 
with high probability. From which we may deduce. 

Corollary 6.4. Let D be a random orientation of the Erdos-Renyi random graph 
G(n,p), with p = uj(logn/n). Then ow(D) = Q(n) with high probability. 

We also note the following cute lower bound, which follows from Lemma 16731 and the 
trivial lower bound ow(D) > A + . 

Proposition 6.5. Let D be an oriented graph, then ow(D) > y/e{D)/2. 

Since there are many results in this section, we shall split into subsections. In 
subsection 16.11 we prove the lower bounds on ow(D), Theorem 16.11 and Proposition 16.31 
In subsection 16 . 21 we prove Thcorcm l6.2l We finish the section with an algorithmic proof 
of the lower bound ow(D) > n/A for regular oriented graphs D. 

We also draw here the reader's attention to the article of Brown, Erdos and Si- 
monovits [3], which considers extremal problems for directed graphs, where parallel arcs 
in opposite directions are allowed. 

Let us state here, the form of Chcrnoff's Inequality [5] that we shall use throughout 
the section. 

Lemma 6.6 (Chcrnoff bound for the sum of Poisson trials). Let ft S [0,1] and let 
Sn = X\ + ... + Xn be a sum of independent random variables Xi taking values in 
{0,1}, with ¥(Xi = 1) = pi for each i = l,...,N, so that the expectation of Sn is 
fi = J^iPi- Then, for all A > fx, 

P(^>(l+/?)A)<cxp(^). 
In the other direction, for all A < fx, 

nS N < (1 - /3)A) < exp (^) . 

The usual statement of Chernoff 's inequality is the A = /i case of the above statement, 
a proof of which is given in |19) . The more general statement given here may be deduced 
from this by an easy monotonicity argument. 



6.1 Lower bounds on ow(D) 

Throughout the section we shall write B{A) for the set {v € V : e({v},A) = 0}. Since 
B(A) maximises e(A, B) over the set of all sets B with e(B, A) = we have that 

ow(D) = max e(A,B(A)). 

ACV(D) 

All we shall need for the following proofs is that, for any probability distribution on 
subsets A of V(D), we have ow(D) > E(e(A, B(A))). 
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Proof of Proposition \6.S\ Let D be an oriented graph with maximum out-degree A + . 
Let A C V be a subset selected at random, each vertex included in A independently 
with probability p. To prove the proposition it suffices to show that, when p is chosen 
appropriately, 

E(e(A,B(A)))>^. 

Expressing e(A, B(A)) as Y^ixyeE ^-xeA,yeB(A)t exchanging the order of summation and 
expectation, and using that E(l^) = P(F) for any event F, we obtain 

E(e(A, B(A))) = J2 V ( x e A > V G B{A)) . 

xy£E 

Fix an arc xy £ E{D). The event y £ B(A) is exactly the event f]veT+(y)( v A )- Thus, 
this event is independent of the event x £ A, and the probability that it occurs is at 
least (1 - > (1 - p) A+{D ^ > 1 - pA+(D). From which we deduce, 

E(e(A, B(A))) > e(D)p(l - P A+). 

The proof is now completed by taking p = 1/2A+. □ 

In the case that D is a regular oriented graph, the above proof shows that E(e(A, B(A))) > 
n/4 where A is selected by including each vertex in A independently with probability 
p = l/2d (where d is the regular in- and out-degree). In the general case, there may be a 
large range of degrees in D, and so there is no obvious choice of p. To prove Theorem l6.ll 
we first find a patch of vertices with approximately the same degree, and then proceed 
as above. 

Proof of Theorem \6.1[ Define subsets V + , V~ of the vertex set V by setting 

V + = {v: d + (v) > d~(v)} and V~ = {v : d~(v) > d + (v)} . 
Partition V + into , . . . V^ og n -| by defining 

V+ = [v£V+ : d+(v) £ [n/T^/T- 1 ) } . 

Similarly partition V~ into , . . . , V^ og . Since these 2[log 2 n\ sets cover V, one of 
them must have size at least n/2[log 2 n~\ . By reversing the orientation of D if necessary 
(which has no effect on ow(D)), we may assume that \V^\ > n/2[log 2 n] for some i. 
Having chosen this patch of vertices V~, we define p = 2 l /3n. We define the random 
subset A of V by including each vertex in A independently with probability p. Let 
B = B(A) n V~ . Arguing as in the proof of Proposition 16.31 above, we have, for each 
y £ V[~ and each arc xy of D, that the events x £ A, y £ B are independent and that 
P(y £ B) = {l-p) d+ ^ > (l~p) d ~^ > l-pd~(y). Thus, 

E(e(A,B))=J2 V(?eA,yeB)> ^ pd~ (y)(l - pd~ (y)) . 

v evr xy£E(D) yev t r 

By the choice of p we have that pd~{y) £ [|, |]. Since x(l — x) > | for all x £ [3, |], 
we deduce that 

E(e(A,B))> 2 J}n>-^. 

9 9|log 2 ri| 

□ 
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6.2 Proof of Theorem 16.21 : An oriented graph with no large rj- 
biased subgraph 

Fix r\ £ (0, 1). To prove Thcorcm l6.2l we must prove, for some constant K = K v , that for 
every n £ N, there is an oriented graph D on n (non-isolated) vertices with bias^D) < 
Kn/ log 2 n. For notational convenience, we in fact prove the bound bias n (D) < K\n/V\ 
where I = Llog 2 (n)J . 

We now define the random oriented graph D. 

Let V := {1, . . . , n}. We partition V into subsets V\, . . . , Vi of approximately equal 
size. Let n = + q, where q is an integers < q < I. We denote r := Let 
V\, . . . , Vi be a partition of V such that: 

ji 

\Vi\ = r fov i = l,...,q and \Vi\ = |_— J for i = q + 1, . . . , I. 

Let G be the random simple graph on vertex set V defined by including each edge xy, 
x £Vi, y £ Vj, in G with probability l/2 ,+J ' -1 , independently of all other edges. Let D 
be obtained from G by orienting its edges at random. 

Basic Properties of D. 

(i) Each arc xy, x € Vi, y € Vj , i ^ j , has probability l/2 l+: > of being an arc of D. 

(ii) The event xy £ E(D) is independent of the event uv € E(D) so long as {u, v} ^ 
{x,y}. Furthermore, the event xy £ E(D) is independent of all events F generated by 
the events uv G E(D) : {u, v} ^ {x, y}. 

(Hi) While there is dependence between the events xy € E(D) and yx £ E(D) (e.g., 
they are mutually exclusive), the bounds < P(x|/ £ E(D) \ F\ < l/2 i+J_1 hold for any 
event F generated by the events uv £ E(D) : (u, v) ^ (x, y). 

Showing that, with high probability, D has no isolated vertex is relatively straight- 
forward, and may be easily checked by the reader. 

We introduce now some new notation. We set 8 = 1 — r], 7 = (l + r])/2, k = \\og 2 
and K = K v = [(32 + 2k)2 8 S~ 2 ~\. An orientation F of an n-cycle has bias v (F) < n. 
This example suffices for all n such that n < Kn/ \og 2 n, i.e., n < 2 K . Fix n > 2 K , 
and let D be the random oriented graph defined above. The proof of Theorem 16.21 is 
completed by proving that with high probability bias v (D) < Kr (recall that r = [~j])- 

Thus, our task is to bound the probability that there is a pair A, B C V with 
e(A,B) > Kr and e(B,A) < ne(A,B). If we consider this event separately for each 
pair A, B, we end up considering far too many events. So we restrict our attention to a 
certain subset of this family of events. We define a family 1Z = lZ(n) of pairs (R, R*) of 
subsets of V such that each biased set E(A,B), for A and B subsets of V, provide two 
elements (R, R*) and (T, T*) of 1Z which do not belong to four type of events, concerning 
pairs of elements of 1Z. This will then reduce the problem to bounding the probabilities 
of these four events; we prove then these bounds and conclude the proof of Theorem 16.21 

To define the family 1Z below, we first need to associate an (increasing with respect 
to inclusion) function defined on the whole family of subsets of V. For a subset R of V, 
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and i = 1, ... ,1, let B4 := Rf] Vi, rt := \Ri\, and define s(R) by 



Note that since each is bounded by r = \j] , s(i?) itself is strictly smaller than r. 
Notice also that if R C A, then s(-R) < s(A). For j = 1, . . . , 21, consider the collection 
of sub-intervals Ij of [0,r) defined by 



By the choice of I (as the largest integer not exceeding log 2 n) , we infer that for every 
non-empty subset R of V, there is a j such that s(R) g Zj. 

For each j = 1, . . . , 21, we denote by 72j the family of pairs of subsets (R, R*) such 
that R C Ui<3+jfe ^ -R* = -RU Uoi+jfc ^ and s ( i? ) e Z j- Formally, 



72^ := { (i?,i?U |J Vi 

t>j+fc 



3+fc 



i? C |J Vi and s(i?) g 



i=l 



-0} 



The family 72. is defined as the union of all the sets 72. j, i.e. 

2/ 

ft := (J K 3 . 

j'=i 



The importance of the family 72. defined above is apparent in the following two 
lemmas, namely that, the pairs (R,R*) € 72. "cover" every non-empty subset of V, and 
the size of 72 is sufficiently small. 

Lemma 6.7. For every non-empty subset A C V , there is an element (i?, R*) g 72. such 
that RCACR* 

Proof. Let 1 < j < I be such that s(A) g [2" J r, Define R = \J i<j+k (Vi n A) 

and i?* = R IJ, v -- Sincc Ei>i+fe+i < 2- j ~ k r, it follows that 2~i- 1 r < 

s(R) < 2-i +1 r, i.e., (R, R*) g 72.^ C 72., and the claim follows. □ 

Lemma 6.8. \K\ < cxp((fc + 8)r) 

Proof. It suffices to prove the bound \TZj\ < cxp((fc + 7)r) for all 1 < j < 2L Fix such 
a j. We have to prove that there are at most cxp((fc + 7)r) sets R C Ui=i f° r which 
s(i?) € p^^V, 2~-' +1 r). In fact we will only use the upper bound s(R) < 2~ J+1 r to 
bound the cardinality of 72^ . 

Define Ri := ROVi so that r, = \Ri\. First remark that the bound s(R) < 2~ J+1 r 
immediately implies that 

n < 2 i+1 ~ j r for all i < j + k. (5) 
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What we shall actually prove is that the number of (j + fc)-tuples (Ri, . . . , Rj+k) with 
the property that Ri C V i and r, < 2 l+1 ~^r, is bounded by exp((A: + 7)r). For i 
j \ ..... j ■ h. the inequalities in ([5]) are automatically verified, i.e., Rj-i, Rj, ■ ■ ■ , Rj+k 
can be any subset of Vj-i, Vj, . . . , Vj+k, respectively. There are 2^ k+2 ^ r < exp((fc + 2)r) 
ways to choose Rj-i, Rj,.. . , Rj+k- 

The number of ways to choose the subsets i?i , . . . , iij-2 can be bounded as follows. There 
are at most r^ 2 < r l < exp(r) ways to choose the sequence of set sizes (ri, . . . , and 
for each choice of (n, . . . , r^-a), the number of sets (i?i, . . . , Rj_ 2 )) with corresponding 
cardinalities is bounded by the product of binomial coefficients V \ ') for i = 1, . . . , j — 2. 
By well known properties of binomial coefficients, and since rj < r/2 by inequalities 
([5]) for i < j — 2, this latter product takes its maximum exactly when each takes its 
maximum allowed value, i.e., for r, = 2 i+1_:, r for each 1 < i < j — 2. We infer that the 
number of (J — 2)-tuples (Ri, ■ ■ ■ , Rj-2) with cardinality constraints above is bounded 

by 

3—2 / \ 3-2 2 i + 1_j r 

exp(r) I 2 , +1 _. J < exp(r) [] (e-^— ) 
i=l v 7 i=l 

< exp(r) exp^2 J '~ 1 ~v) exp^(« + I- j)2 j ~ 1 ~ l r^j < cxp(5r), 

i=l i=l 

and the lemma follows. Note that in the first inequality above, we used the well known 
bound (?) < (ea/b) b on binomial co-efficients (using the fact that x\og(y/x) < y/xy for 
all positive reals x,y). □ 

Lemma 16.81 shows that the number of pairs (R,R*),(T,T*) £ 1Z is bounded by 
exp((2fc + 16)r). This is the main property we will use in the following. Namely, the 
four type of events that we are going to define below concern pairs of elements of TZ, and 
have the property that the probability that one of these event happens for a given pair 
of elements in 1Z, is bounded by exp(— (32 + 2fc)r). The bound on the total number of 
pairs then show that with (high) positive probability (as function of r) non of the four 
events happen. This will then complete the proof of Theorem 16.21 since as we said, we 
will show that the event there exists a biased E(A,B) is contained in one of these four 
events. 

In the following, we use the notation e(R, T) for the number of edges between R 
and T in the underlying graph G, so that e(R,T) = e(R,T) + e(T,R). We also write 
e(R, R*;T, T*) for e(R*,T*) - e(R, T). Consider the following four events concerning a 
given pair of elements (R, R*) and (T, T*) in 1Z. 

Fi: For a given pair of elements (R, R*), (T, T*) £ TZ with s(R)s(T) > Kr/A, 

e(R,T) < s(R)s(T)/2. 
F n : For a given pair of elements (R, R*), (T, T*) £ TZ with e(R, T) > Kr/8, 

e(T,R) <je(R,T). 
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F m : For a given pair of elements (R, R*), (T, T*) £ TZ with s(R)s(T) > Kr/4, 

e(R,R*;T,T*) > Ss(R)s(T)/A. 
Fiv: For a given pair of elements (R, R*), (T, T*) G TZ with s(R)s(T) < Kr/4, 

e{R*,T*) > Kr. 

The following lemma shows us that the event of interest to us is contained in the 
union of the above four type of events for pairs of elements of TZ, and so combining the 
bounds on the probabilities of F* with Lemma fc.81 will conclude the proof of Theorem 
IO 

Lemma 6.9. The event that there is a pair A, B C V with e{A, B) > Kr and e{B, A) < 
r)e(A,B), is contained in the union Fi U Fn U Fm U Fiv over all pair of elements 
(R,R*),(T,T*) oflZ. 

Proof. Suppose none of the events Fi, Fn, Fm, Fiv occur for any pair (R, R*), (T, T*) 
in TZ. We have to show that there is no pair A,BCV with e(A, B) > Kr and e(B, A) < 
r]e(A,B). For the sake of a contradiction, suppose A,BC V be such a (biased) pair. 
Applying LemmalO let (R, R*) and (T, T*) be two elements of TZ such that R C A C R* 
and TCBC T*. We will consider the four events F* above for the pair (R, R*), (T, T*), 
and the pair (T, T*), (R,R*), and derive a contradiction. 

• If s(R)s(T) < Kr/A, then, since Fiv does not occur, we should have e(A, B) < 
e(R*,T*) < Kr, contradiction. 

• If, on the other hand, s(R)s(T) > Kr/4, then, since none of Fi, Fn, Fm occur for the 
pair (R, R*), (T, T*), and the pair (T,T*), (R,R*), we have that e(R,T) > s(R)s(T)/2, 
that e(T,R) > je(R, T), that e(R, T) > je(T, R), and that e(R, R*;T, T*) < Ss(R)s(T)/4 

It follows that 

e(B,A) > e(T, R) > l±l e {R,T) = ^{e(R*,T*) - e(R, R* ;T,T*)) . 

Note that e{R,T) = e(R,T) + e(T,R) < (1 + ±)e{R,T). We infer that e(R,T) > 
^e(R,T) > e(R,T)/2 > e(R)e(S)/4. From the bounds e(R*,T*) > e(R, T) > 
s(R)s{T)/4 and e(R, R*;T,T*) < 5s(R)s{T)/4:, we deduce that 

£{R,R*;T,T*) < 5e(R*,T*)/2, 

and so 

e(B,A) > l±l (i _ ^je(R*,T*) > r ]e (R*,T*) > r)e(A, B) , 
yielding again to a contradiction. □ 

So to complete the proof of Theorem 16.21 by the choice of K, we are only left to prove 
the following lemmas. 
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Lemma 6.10 (Bounding P(Fi)). For any pair (R, R*), (T, T*) e TZ with s(R)s{T) > 
Kr/A, 

P (e(R, T) < s(R)s(T)/2^j < exp(-A>/32). 

Lemma 6.11 (Bounding P(F n )). For any pair (R,R*),(T,T*) G TZ with e(R, T) > 
Kr/8, 

' e(T, R) < je(R,T)) < exp(-Kr5 2 /128). 



Lemma 6.12 (Bounding P(F m )). For any pair (R, R*), (T, T*) G TZ with s(R)s(T) > 
Kr/A, 

P ( e(R, R*;T, T*) > 5s{R) S {T)/A ) < exp ( Z ^) • 

Lemma 6.13 (Bounding P(Fi V )). For any pair (R,R*), (T, T*) G TZ with s{R)s(T) < 
Kr/A, 

w(e(R,T) >Kr\ < exp(-A>/3). 

Proof of Lemma \6.1(A Recall that G is the random simple graph underlying D. The 
random variable e(R,T) may be expressed as ^2{ x y }- x< zR : v ^t l{i.y}6£(G)i a sum °f m ~ 
dependent {0, l}-valued random variables. Its mean may be bounded below by 

1 1 r-t- Kr 
E(e( J R,T))>^^^= S ( J R>(T)>^. 

i=l j=l 



And so, by Chernoff's inequality, Lemma 16.61 we have 

P (e(R,T) < s(R)t(R)/2^j < exp(-s(R)s(T)/8) < exp(-A>/32) 



□ 



Proof of Lemma\(nj\ Let (R, R*),(T,T*) £ TZ be such that e(R,T) > Kr/8. Let 
R' = R \ T, let V = T \ R, and let S = R n T. We may express e(i?, T) as e(R', T') + 
e(R', S) + e(S, S) + e(S, T'), and e(T, R) as e(T', R 1 ) + e(T', S) + e(S, S) + e(S, R'). Let 
us introduce the notation e'(R, T) = e(R', T') + e(R', S) + e(S, T') = e(R, T) - e(S, S), 
and similarly e'(T,R). Let e'(R,T) = e'(R,T) + e'(T,R). A few lines of calculation 
show that the event that e(T, R) < je(R, T) is exactly the event that (1 + j)e'(T, R) < 
je'{R,T) - (1 - 7 )e(5,5). The mean of e'{T,R) is n = e'(R,T)/2, and so the above 
inequality becomes 

V 1 + 7 [i 

Setting (3 = (1 — 7)/(l + 7) + (1 — r y)e(S, S)/ /j,, an easy calculation shows that (3fi > 
(1 — -f)e(R, T)/(l + 7), and so, from Chernoff's inequality, we deduce that 

F(e<T,K) < je(R,T) ) < exp (- (^)^) < cxp < ~ Krfi 

< exp 



1 + 7/ 2 J ~ * V64(l + 7) ; 
-KrS 2 * 
128 



□ 
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Proof of Lemma WM Let (R,R*),(T,T*) e 11 be such that s(R)s(T) > Kr/4. The 
definition of K implies that s(R* \ R) < 6s(R)/32, likewise s(T* \ T) < §s(T)/32. 

(Note that if {R,R*) G then s(iT \ i?) < 2^'-*r = 2-J- 1 r5/32 < s(R)S/32.) 

We let P be the set of pairs with one end point R* and the other in T* which 
do not have one end in R and the other in T. We may bound the expected value of 
e(R,R*;T,T*) by 

V- Pl1 . ^ nt*, + r*U> + r*£*, 
2^ P(l{ U ,t.}€B(G)) < 2^2^ ^T^I ■ 

{u,v}eP i=1 »' =1 

The latter sum has value 2(s(R)s{T*)+s(R*)s{T)+s(R*)s(T*)) < (S/8+5 2 / 512) s(R)s{T), 
and so E(e(R,R*;T,T*)) < 3Ss{R)s(T)/8. Applying Chernoff's inequality, LemmaES 
with A = 3Ss(R)s(T)/ 16 and ^ = 1/2 we obtain the bound 

□ 

Proof of Lemma\KTM Let (R,R*),(T,T*) G ft be such that s(R)s(T) < Kr/4. An 
upper bound for e(i?, T) is given by S{u,o}:uGfl,«eT l{u,t>}e-E(G)- This is a sum of inde- 
pendent {0, l}-valued random variables, and so, by Chernoff's inequality (Lemma I6.6p . 
it suffices to prove that its expected value is at most Kr/2. This is easily proved, 

E m^}e E( G)) <EE ^Sfer = mr)s(t) < ^ . 

uei?,ueT i=l i'=l 

□ 

6.3 Algorithmic approach 

Proposition l6.3l implies that in every regular oriented graph D there are subsets A,BCV 
with e(A, B) > n/4 and e(B,A) = 0. We now give a polynomial time algorithm which 
can find a pair A,BCV with e(A, £?) > n/4 and e(S, A) = 0. 

Our algorithm will begin by building a sequence of sets A\, . . . , A n for which e^(A) 
is not too large. It then selects a set to be A and from that defines B. 

We now analyse the algorithm. 
Lemma 6.14. For each t = 1, . . . , n — 1 the set A t satisfies e^ 2 \A t ) < d 2 (t 2 — l)/n. 

Proof. This is clearly true when t = 1. Suppose the bound fails: let t be minimal such 
that e^ 2 \A t +\) > d 2 ((t + l) 2 — l)/n. Note that, by the definition of A t +\, one has that 

eW(A t+1 ) = e W(A t )+ min e^(A t ,{u}). 
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l: Let t = 1, let v be an arbitrary vertex and let A\ = {v}. 

2: while t < n do 

3: Select a vertex u G V \ A t minimising e^ 2 >(A t , {u}). 

A-. Set At+i =A t U {u}. 

5: Increase t by one. 

6: end while 

7: Let t = \n/2d\ 

8: Let A — A t 

9: Let B = {v : e({v}, A) = 

10: return A, B 



Combining this with the observation that the sum of e^ 2 > (At, {u}) over vertices u G V\A t 
is precisely e^(A t ,V \A t ) = 2d 2 t - 2e {2) (A t ), we obtain that 

2d 2 t - 2e^(A t ) > (n - t)(e^(A t+1 ) - e^(A t )) . 

Straightforward calculation, and the use of the bounds we are assuming on e^ 2 '(A t ) and 
e^(A t +i), yield that t > n — 2. Hence there cannot exist t G {1, . . . ,n — 1} for which 
the bound fails. □ 

Let us define a quadratic equation f(t) = dt — d 2 (t 2 — l)/n. This quadratic obtains 
its maximum at n/2d. 

Proposition 6.15. Given a d-regular oriented graph D. The above polynomial time 
algorithm finds subsets A, B C V satisfying e(A, B) > n/4 and e(B, A) = 0. 

Proof. Let t = [n/2d\. The algorithm outputs A = A t and B = {v : e({v}, A) = 0}. It 
is immediate from the definition of B that e(B, A) = 0. We now prove the bound on 
e(A, B). Let us note that each arc from A to V \ B is the first arc of a path of length 
two from A to A. Thus e(A,V\B) < e^(A). And so e(A, B) = e(A,V)-e(A,V\B) > 
d\A\ — eS 2 \A). We now use the fact that A is obtained as A t . This implies that \A\ = t, 
and, by Lemma \6. 141 that e^(A) < d 2 (t 2 — l)/n. Substituting these values we obtain 
that e(A, B) > f(t). Since / is a quadratic with its maximum at n/2d its value in the 
range (n/2d— l,n/2d] is always at least f(n/2d— 1). A simple calculation shows that 
f(n/2d — 1) = n/4, completing the proof. □ 

7 Concluding Remarks 

We would very much like to see a proof of Conjecture 11.41 By Theorem ll.il the k = 2 
case of this Conjecture is settled. Theorem 11.21 proves that every oriented graph with 
bias(D) < ee(D) 2 /n 2 contains an oriented six-cycle. This is weaker than the k = 3 case 
of the conjecture, which requires that every oriented graph with bias(D) < ee(-D) 3 / 2 /n 
contains an oriented six-cycle. If this result does hold then it is best possible. One can 
prove this by considering a random orientation of the largest (with respect to number 
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of edges) simple graph (on n vertices) with girth greater than six, further examples 
being obtained as blow-ups of these first examples. The fact that these examples have 
bias(D) = 0(e(D) 3 / 2 jn) relies on the fact that there is a constant c > such that for 
all n, there is a simple graph G on n vertices which has girth greater than six and such 
that e(G) > crc 4 / 3 . Such graphs may be constructed using the rank two geometries 
introduced by Tits [22] . 

Bondy and Simonovits [5] proved that a graph with girth greater than 2k has 
(J(n 1+1 / k ) edges. It is widely believed that this result is best possible, i.e., it is be- 
lieved that for each k > 2, there is a constant Ck > such that for all n, there is 
a graph on n vertices with girth greater than 2k and with e(G) > Cfcn 1+1 / fc . This is 
known to be true for k = 2, 3, 5. If it is true for all k, then by considering blow-ups of 
random orientations of graphs with girth greater than 2k, we can find oriented graphs 
with bias(D) = 0(e(Z?) fe /( fe-1 )/n 2 /' fc-1 ') which contain no oriented 2fc-cycles. In this 
case Conjecture 11.41 if true, is best possible. 

Our final remark on Conjecture 11.41 is that it seems unlikely that a straightfor- 
ward adaptation of our approach will give a proof. Our proofs focus on what hap- 
pens locally to a given vertex. We examine the arcs from the out-neighbourhood 
of a vertex to the second out-neighbourhood. However, there are oriented graphs in 
which each component contains only e{D) 2 /n 2 arcs, and so a condition of the form 
bias{D) < £ e(D) fc /(fc-i)/ n 2/(fc-i) (for k > 3) gives no information about a particu- 
lar component. Thus, any successful approach to Conjecture 11.41 must go beyond the 
aggregation of certain locally observed inequalities. 

Up to this point we have never explicitly put any condition on the underlying graph. 
What happens if we do? This may change the problem considerably. Certainly the 
examples we have used up to this point have a certain structure to their underlying 
graph as well as to the orientation. One particular case that may be of interest is the 
case in which the underlying graph is random. The most general question that arises in 
this context is the following. 

Question 7.1. Let H be an oriented graph and let p ~ p(n) be some function of n 
(e.g. p(n) = n 1 ! 2 ). Then for which function b(n) do we have the following: with high 
probability, every orientation D of the Erdos-Renyi random graph G(n,p) satisfying 
bias(D) < b(n) contains a copy of H? 

The two functions b{n) and p{n) are certainly correlated. In the extreme case, when 
p(n) goes to sufficiently slowly that there arc (whp) many copies of H is G(n,p), 
one may ask, similarly to the dense case, whether a condition as weak as bias(D) < 
epn 2 ~ ee(G(n,p)) suffices to ensure that (whp) D contains a copy of HI Let H 
be the underlying unoricntcd graph of H with n# vertices and en edges, and with 
upper density 6h '■ this is by definition the maximum density of a subgraph of H, i.e., 
5h ■= max[/cy(_ff) ■ By the result of Erdos and Renyi [11] (for the balanced case) 

and Bollobas [5] (for the general case), there exists a threshold at pn = n for the 
event that G{n,p) contains a copy of H. Also, large deviation and concentration results 
for the number of homomorphic copies of H in G(n,p) around n nH p eH are known (e.g., 
see [20j HH [26j Hi])- This is roughly the expected number of copies of H in G(n,p) 
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modulo a constant factor imposed by the automorphisms of H. It is easy to see that, 
in order to have a result ensuring the existence of H as an oriented subgraph for an 
orientation of a random graph G(n,p) with a condition as weak as bias(D) < epn 2 , we 
should have n nH p eH = Q,(pn 2 ). So, based on some related questions on quasi-random 
unoricnted graphs, it is reasonable to conjecture that the following should be true. 

Conjecture 7.2. Let 8h = maxjjcv(H), \u\>3 and p(n) >> n 5 h . Then with 

high probability the following holds: an orientation D of G(n,p) has an oriented copy of 
H as subgraph provided that bias(D) < epn 2 . 

Indeed, the above conjecture follows from a conjectural version of the embedding 

lemma in the sparse case (c.f., conjecture 8 in [TJ] and conjecture 3.11 of Q3|). This 

can be seen by mimicking the proof given in |T3] of a related conjecture on the values 

i 

of ex(G(n,p), H) in the case pn 5 f — > oo assuming the sparse version of the embedding 
lemma (c.f., Conjecture 3.15 and Theorem 3.16 in 

We now discuss other parameters that may be of interest. Can one ensure the 
presence of an oriented four-cycle by putting an upper bound on max{e(A, B) — e{B, A) : 
A,BC V}? The answer is of course "Yes", as it follows from Theorem 11.11 that an 
upper bound of the form ee(D) 2 /n 2 suffices. Surely this is not best possible. What is 
the largest upper bound that suffices? Another parameter that one might consider is the 
size of the largest eigenvalue. What is the largest upper bound on the largest eigenvalue 
that can ensure the presence of an oriented four-cycle? Finally, another parameter that 
may be of interest is the size of the largest one-way subgraph (c.f. Section let 
ow(D) — m&x{e(A, B) : A,B C V , e(B,A) = 0}. Can an upper bound on ow(D) 
ensure the presence of an oriented four-cycle in Dl It is quite possible that a bound of 
the order of e(D) 2 /n 2 suffices. 

Conjecture 7.3. There exists a constant s > such that every oriented graph D with 
ow(D) < ee(D) 2 /n 2 contains an oriented four-cycle. 
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A Dense Case Using Szemeredi's Regularity Lemma 



In this appendix, we prove the following theorem. 

Theorem A.l. Let h £ N be a fixed integer and H be an oriented graph on h vertices. 
Then for every v £ (0, 1), there exists real numbers e > 0, < (3 — (3(H) < and 
an integer N such that every oriented graph D on n > N vertices, with /3n 2 arcs with 
bias v {D) < ee, contains H as a subgraph. 

When H is an orientation of a bipartite graph, (3 can be any arbitrary small positive 
value. 

The proof of Theorem IA.1I is based on the use of Szemeredi's regularity lemma, so 
we need first to state it. Let us introduce the following notation: Let U and W be two 
disjoint subsets of the vertex set V of a given (oriented) graph D. The number of edges 
between U and W without taking into account the orientation, if any, is denoted by 
e(U, W). Recall that the density of the pair U, W is 

Definition (Uniform Pairs). Let a £ (0, 1) be a real number. The pair (U, W) is called 
a-uniform if, for all subsets V C U and W C W with \U'\ > a\U\ and \W'\ > a\W\, 
we have \d(U', W) - d(U, W)\ < a. 

A simple consequence of the a- uniformity condition above is that bipartite subgraphs 
induced by uniform pairs must be roughly regular. More precisely we have the following 
lemma (the proof of which can be found for example in [5] ) . 

Lemma A. 2. Let (U, W) be a a-uniform pair and let d(U, W) = d. Then we have 
\{u £ U : \T(u) n W\ > {d - a)\W\}\ > (1 - a)\U\ 

and similarly 

\{u £ U : |r(u)n W| < {d + a)\W\}\ > (1 - a)\U\. 

Definition (Equipartitions). An equipartition of V(G) into in parts is a partition 
Vi,...,V m such that L^J < \Vi\ < ^1, forl<i< m, where n = \V(G)\. The 
partition is a-uniform if (Vi, Vj) is a-uniform for all but c(™) pairs 1 < i < j < m. 

Lemma A. 3 (Szemeredi Regularity Lemma |25j). Let a be a real number in (0, 1) and 
let I and N be two natural numbers. Then there exists L = L(l,a) such that every graph 
with at least N vertices has a a-uniform equipartition into m parts for some I < m < L. 

We are now ready to prove the following directed version of the embedding lemma 
in the presence of an appropriate condition on the bias. 

Lemma A. 4. Let h and m £ N be two fixed integers with m > h, H be an oriented 
graph on h vertices, and v a real number in (0,1). Let D be a digraph with e arcs, 
containing disjoint subsets V\, . . . , Vh of size u or u + 1 of the vertex set V such that 
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(ii) For each arc (vi,Vj) G E(H), (Vi,Vj) is a-uniform; 

(Hi) For each i ^ j with G E{H), d(Vi,Vj) > rj for some positive real number 

r 1 >0; 

(iv) For each i ^ j with (i,j) G E(H), bias^(D{Vi U Vj)) < ee, where e = — | . 

Furthermore, suppose that (y^x-ct) > (h + 1) a, and finally au > 1. Then D contains 
an oriented subgraph isomorphic to H . 

Proof. Let {v\, . . . , Vh] be the set of vertices of H . We prove by induction on h that the 
following property holds for alii = 0, . . . , h: 



Property V(l): There exists a sequence x±, . . . ,xi so that 
I. for alii = 1, . . . , I, X{ G Vf, and 

II. for every j, I < j < h, there is a set C Vj of potential candidates for Xj at stage 
I of size larger than ( y^t — cf) n ^' 1 'u. More precisely, if we set N~ (j, I) := { Xi | 1 < 

i < landviVj G E(H)}, N + (j,l) := {x t \l < i < landVjVi G E(H)}, andiV(j,/) = 
N~(j, I) U N + (j, /), then for any y 3 G Xj and for every x l G N(j, I), x,yj G E(D) 
or yjXi G -E(-D) provided that Xi G N~(j,l) or Xj G N + (j,l) respectively, and 

moreover, > - a) N ^u. 

i 

It is clear that the property 'P(O) holds, for this we just take X° = Vj. 

Suppose now that V{1) is true, we will prove that V(l + 1) also holds. 

Let T + be the set of indices j for which we have an outgoing arc vi + \Vj from u; + i 
to Vj, i.e., 

T+ ={j>l + l\vi +1 Vj€E(H)}. 
Similarly, we define T~ for the set of incoming arcs: 

T~ = {j>l + l\vjV l+1 £E(H)}. 

We should select xi+i G X l l+1 in such a way to have arcs from xi+i to the vertices of 
the set Xj +1 in the appropriate direction, and such that updating Xj to X l } +1 we respect 
the condition II of the property P(l + 1). In the following, let Yj be the set of vertices 
of X l [+1 which does not have enough arcs to Xj and hence are not good candidates for 
xi+i. More precisely, for each t G T + (resp. for each t G T~), we define the set Y t as 
follows 

Y t := { y e x\ +1 1 |r+(y) n x\\ < (-^ - a)\X\\ } 
(resp. Y t := { y G | |r"(y) n X<| < (-^T - } ) . 
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By property V(V) and the hypothesis, we have: 

\X\\ > (j^j a) N ^\V t \ > (^Lj af\V t \ 

V V 

> (h + l)a\V t \ > a\V t \ > au . 

It follows from Definition [21 the fact that bias v {X\ U Y t ) < ee, and the choice of 
parameters that for all t £ T + U T~, < <r|Vz+i|. To see this, suppose for the sake 
of a contradiction that this is not the case, i.e., \Y t \ > a\Vi+i\. By the choice of e and 
\Vi+i\ > ^, this means that |Yt||-X^| > a 2 u 2 > ^n 2 > ( ,/ j) ee. This means that 

€e<(^-a)\Xl\\Y t \. 

From now on without loss of generality we will suppose that t £ T + (the other case 
follows similarly). Two cases can happen: 

. iie{X\,Y t ) <ee, then we have e(X|,y t ) < - a)\X l t \\Y t \ < I^-a)!^^. 
By the definition of Y tl we also have e(Y t , X\) < (y^x — cr)\Xl\\Yt\. It follows that 

d(XLY t ) = ^yiViZ^ < f] — a. Wc saw that X\ > <r\Vt\ and by our assumption wc 
have Y t > ct|V]+i|. These all will provide a contradiction to the cr-rcgularity, since 
d(V t ,V l+1 )>ri. 

• if e(X l t ,Y t ) > ee then by the condition on the bias, e(Y t ,X\) > ve(X l t ,Y t ) 1 and 
this means that e(X l t ,Y t ) < \e(J t ,X\) < Hj^t - o)\X l t \\Y t \, So 

e(X l t ,Y t ) = e{X\,Y t ) + e{X l t ,Y t ) < (1 + + l/v) - a)\X\\\Y t \. 

This implies that d{X\,Y t ) < (1 + l/u)(rj/(l + I/7) - a) < (r) - a) and this be 
again a contradiction as in the first case. 

We infer that Y t < a\Vi +1 \. Therefore \X l l+1 \ (Ui eT - uT +r*)| > (r)/(l + l/v) - 
a)m+U)\ Vl+l \ - \T- U T+\a\V l+1 \ > ((77/(1 + l/v) - a) N ( l +^ - ha)\V l+1 \ > au > 1, 
and so the set X\ +1 \ {^teT-uT+Y 1 ) is not empty This means that we can select 
.t; + i £ X l l+1 \ (U t£ T-uT+y*) in such a way to respect the condition I of 1^(1 + 1). By the 
choice of xi+\, it is easy to verify that by taking X\ +1 = X\ n T(xi+i) for t £ T~ U T + 
and X\ +1 = X\ for t £ T~ U T+, we respect also the condition II of V{1 + 1). And 
so V(l + I) is true. The above induction shows that V(l) holds for all 1,0 < I < h, so 
Xi, . . . ,Xh can be found as desired. These vertices form a copy of H . □ 

Proof of Theorem \A.l\ We consider an oriented graph H on h vertices, a real number 
v £ (0,1), a sufficiently large integer number n (n > N, for N being fixed later), a 
digraph D on n vertices with e = j3n 2 (/? to be fixed later) arcs whose bias is at most 
vee (e to be fixed later). In what follows, for an oriented graph D, we denote by D the 
corresponding non-oriented graph obtained by forgetting the orientation of D. 

We now apply Szemeredi's regularity lemma to D for I and some a to be fixed later. 
We set / sufficiently large but fixed such that a graph G on at least I vertices with 
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(1 — a) (2) edges contains a copy of H as subgraph. We obtain a cr-uniform equipartition 
of V(G) into m parts Vi, . . . , V m with I < m < L. We require ft to satisfy: 

2 2 

/3n > t?tt H + exfm, #)(—), 

2 m m 

for some small enough 77. When iJ is non bipartite, ex(m, h) > (1 — cr)( 2 ), and it gives 
a lower bound on (3 in terms of 77 and a (as we will see 77 and cr are correlated in such a 
way that 77 small enough will imply a small enough and a < 77). When 77 is bipartite, 
we can choose /3 as small as we want as long as rj is also chosen small (it just implies 71 
to be big enough). 

The idea is that from this partition we construct a graph G, in which each vertex 
represents a part Vi of the corresponding partition. Two vertices of G are linked together 
if the corresponding parts have a density of at least 77 in between them (i.e., (vi,Vj) € 
E(G) iff d(Vi,Vj) > 77) for some < 77 < 1 which will be chosen according to the 
hypothesis of Lemma lA.41 As long as 77 is not too big, there is some /3 that ensures us 
that e(G) > ex(m, H). Consequently we get a copy of H in G, wlog it uses the vertices 
Vi,...,Vh, Vi € Vi, and we add m — h vertices Vh+i € Vh+i- 

We will apply Lemma IA.4I with appropriate values of e, 77 and a. Wc fix / large 
enough and we choose 77 small enough such that a graph G with (1 — t?' 1 )(™) edges 
contains a copy of H for every m > I. Once I and 77 are fixed, we choose a such that 
(jtt ~ 17 ) ^ (h + l)* 7 - Remark that such a positive er always exists and a < 77. To see 

this, remark that the continuous function f{x) = (y^x ~ x^J h — (h + l)x has a positive 
value at a; = and so for some small x close to zero it will still remain positive. We 
apply Szemeredi's regularity lemma to a and I and D on n vertices with fin 2 edges (/3 
chosen as above with respect to this choice of a and 77) for n large enough. There exists 
L such that a partition to m parts exists with I < m < L. We choose N large enough 
such that <Jj->1. Remark that u > ^ in Lemma [A.4I so that era > 1. Let us choose 

£ = ^5 • We claim that for these choices of parameters every oriented graph D 

on 71 > TV vertices with bias(D) < vee contains a copy of H . Let V±, . . . , Vh be the parts 
of the equipartition which provide a copy of H . We apply lemma [A~4l to this data. We 
should verify that the conditions of lemma are verified: 

(i) We have easily u > — ; 

(ii)&(iii) For each arc £ H, the pair (Vi,Vj) is cr-uniform and d(Vi,Vj) > 77. This is 
true because we find the copy of H in the graph G formed by the pairs which were 
cr-uniform and have the required density condition; 

(iv) For each arc bias v {D(Vi,Vj)) < ee, where e = ^ . This is true 

because D has this property, and so D(Vi, Vj) inherits this property from D. 

Furthermore, the conditions (y^x — a ) k — Q 1 + 1)°" an( i cru > 1 arc also verified. So we 
can apply Lemma [A. 41 to find a copy of H in D. □ 
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A result from N. Alon and A. Shapira [T] says that if we have an oriented graph such 
that we have to remove en 2 arcs before it becomes H-free, then it has many copies of 
H . The exact statement of their theorem is the following: 

Theorem A. 5 (Alon and Shapira [I]). For every fixed e and h, there is a constant 
c[e, h) with the following property: for every fixed digraph H of size h, and for every 
digraph D of a large enough size n, from which we need to delete en 2 arcs to make it 
H-free, D contains at least c(e,h)n h copies of H . 

From this theorem and Theorem lA.il we may deduce the following Corollary: 

Corollary A. 6. For every h, every oriented graph H on h vertices, there exist e > 0, 
< /3 < |, c(e,h,/3) and an integer N such that every digraph D on n > N vertices, 
with more than fin 2 arcs and with bias(D) < ee contains c(e, h)n h copies of H . 

Proof. (Sketch of proof) Let h be an integer and H an oriented graph on h vertices, 
Theorem lA.il gives e, f3 and N such that every digraph D on n > N vertices, with fin 2 
arcs and with biaso.4{D) < ee contains H as a subgraph. 

Setting e' = e/3/10, every digraph D on n > N vertices, with (/3 + e')n 2 arcs and 
with bias(D) = bias 0.5(D) < ee contains H as a subgraph. On top of this, if we delete 
e'n 2 arcs, the remaining oriented digraph D' still contains H as a subgraph. Indeed D' 
has more than /3n 2 arcs, and one can easily check that bias 0. a{F>') < ee, so Theorem lA.il 
ensures the existence of a copy of H . 

We may now apply Theorem IA. 51 which will imply the existence of c(e', h)n h copies 
of H in D. Notice that if f3 + e' > 1/2, we can change the value of e' to some smaller 
value in order to have a number bellow 1/2, and the reasoning remains unchanged. □ 

In other words, the above corollary says that given h, there exist e>0, 0</3 < \, 
c(e, h, (3) and an integer N such that every digraph D on n > N vertices, with fin 2 arcs 
and with bias(D) < ee contains the correct order of any orientation of any graph on h 
vertices. 
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